Transcribing Multi-instrument Polyphonic Music with Transformed Eigeninstrument Whole-note Templates
نویسندگان
چکیده
We present a system for the transcription of polyphonic music recordings to recover both the notes played and the instruments responsible for each note. In our framework, the spectrogram of the music is viewed as the superposition of note events, each characterized by an onset time and pitch, an instrument (described by a vector of eigeninstrument weights that combine instrument model bases to match a particular source in the mixture), and per-note transformation parameters that take a durationand decaynormalized note template and extend it to match the actual duration and dynamics of each individual note. Transcription is achieved through an EM-like iterative estimation scheme. Initializing this estimation using a rough separation of sources from a frame-based transcription system gives stable and accurate results that directly describe the audio at a note, rather than a frame, level, with each note attributed to a particular instrument. This approach significantly improves transcription accuracy over a frame-level system, apparently because the transcription constrains each note to obey the dynamics encoded in the templates. Notelevel transcription accuracy on real woodwind excerpts from the MIREX Multiple F0 evaluation improves from 64% (frame-level) to 67%; for the more temporally-structured notes in the RWC piano examples, accuracy improves from 70% to 79%, with dramatic reductions in false alarms.
منابع مشابه
Adaptive oscillator networks for partial tracking and piano music transcription
This paper presents our recent work in developing a system for transcription of polyphonic piano music. Our goal is to build a system that would automatically transcribe polyphonic piano music from the audio signal, transcribing note onsets and offsets. The system consists of three main stages: filtering, partial tracking and note extraction. The paper presents our partial tracking method based...
متن کاملInstrument Recognition in Polyphonic Mixtures Using Spectral Envelopes
Instrument recognition in polyphonic music is a difficult task in computer audition. Many current methods approach this problem by first attempting to separate the timbre features among the sources present in the mixture using source separation, multi-pitch estimation, or noteonset techniques. Instrument (timbre) recognition then proceeds on these separated features. This study proposes another...
متن کاملInstrument Identification in Polyphonic Music: Feature Weighting with Mixed Sounds, Pitch-Dependent Timbre Modeling, and Use of Musical Context
This paper addresses the problem of identifying musical instruments in polyphonic music. Musical instrument identification (MII) is an improtant task in music information retrieval because MII results make it possible to automatically retrieving certain types of music (e.g., piano sonata, string quartet). Only a few studies, however, have dealt with MII in polyphonic music. In MII in polyphonic...
متن کاملMultiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model.
A method for automatic transcription of polyphonic music is proposed in this work that models the temporal evolution of musical tones. The model extends the shift-invariant probabilistic latent component analysis method by supporting the use of spectral templates that correspond to sound states such as attack, sustain, and decay. The order of these templates is controlled using hidden Markov mo...
متن کاملFrom Raw Polyphonic Audio to Locating Recurring Themes
We survey several approaches to the task of transcribing polyphonic music on the diatonic scale and introduce some new ones. We do not address the issue of instrument identification at all; instead we limit our analysis to a single keyboard string instrument with discrete pitch such as piano or harpsichord. The second part of our work is concerned with comparison of musical sequences of polypho...
متن کامل